A Question-Answering Approach to Key Value Pair Extraction from Form-Like Document Images

نویسندگان

چکیده

In this paper, we present a new question-answering (QA) based key-value pair extraction approach, called KVPFormer, to robustly extracting relationships between entities from form-like document images. Specifically, KVPFormer first identifies key all in an image with Transformer encoder, then takes these as questions and feeds them into decoder predict their corresponding answers (i.e., value entities) parallel. To achieve higher answer prediction accuracy, propose coarse-to-fine approach further, which extracts multiple candidates for each identified question the coarse stage selects most likely one among fine stage. way, learning difficulty of can be effectively reduced so that accuracy improved. Moreover, introduce spatial compatibility attention bias self-attention/cross-attention mechanism better model interactions entities. With techniques, our proposed achieves state-of-the-art results on FUNSD XFUND datasets, outperforming previous best-performing method by 7.2% 13.2% F1 score, respectively.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Question Answering Approach to Emotion Cause Extraction

Emotion cause extraction aims to identify the reasons behind a certain emotion expressed in text. It is a much more difficult task compared to emotion classification. Inspired by recent advances in using deep memory networks for question answering (QA), we propose a new approach which considers emotion cause identification as a reading comprehension task in QA. Inspired by convolutional neural ...

متن کامل

Question Answering from Large Document Collections

We present a question answering system with a hybrid design, combining techniques from knowledge representation, information retrieval, and natural language processing. This combination enables domain independence and robustness in the face of text variability, both in the question and in the raw text documents used as knowledge sources. We describe the specific design of our current prototype,...

متن کامل

A Question Answering Approach for Emotion Cause Extraction

Emotion cause extraction aims to identify the reasons behind a certain emotion expressed in text. It is a much more difficult task compared to emotion classification. Inspired by recent advances in using deep memory networks for question answering (QA), we propose a new approach which considers emotion cause identification as a reading comprehension task in QA. Inspired by convolutional neural ...

متن کامل

A Multi-agent Approach to Question Answering

In this paper we present a multi-agent approach to question answering for the Portuguese language. Our proposal is composed by three modules: (1) document and query processing; (2) ontology construction; and (3) answer generation. Each module is composed by multiple cooperative agents which adopt distinct strategies to generate its outputs and cooperate to create a global result. This approach ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2023

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v37i11.26516